ALMOST HADAMARD MATRICES: THE CASE OF ARBITRARY 

EXPONENTS 



TEODOR BANICA AND ION NECHITA 

Abstract. In our previous work, we introduced the following relaxation of the Hada- 
mard property: a square matrix H <E Mat(R) is called "almost Hadamard" if U = H/y/N 
is orthogonal, and locally maximizes the 1-norm on 0(N). We review our previous re- 
sults, notably with the formulation of a new question, regarding the circulant and sym- 
metric case. We discuss then an extension of the almost Hadamard matrix formalism, by 
making use of the p-norm on O(N), with p £ [1, oo] — {2}, with a number of theoretical 
results on the subject, and the formulation of some open problems. 
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Introduction 

An Hadamard matrix is a square matrix H G Mjv(±1) having its rows pairwise or- 
thogonal. The Hadamard conjecture (HC), which is over a century old, states that such 
matrices exist, at any N £ 4N. See [T], [IB], [2D], The circulant Hadamard conjecture 
(CHC), which is half a century old [22], states that a circulant Hadamard matrix can exist 
only at iV = 4. More precisely, only the following matrix K4 and its various "conjugates" 
can be at the same time circulant and Hadamard, regardless of the size JVeN: 
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An interesting generalization of the Hadamard matrices are the complex Hadamard 
matrices, namely the matrices H G Mjy(T), where T is the unit circle, having their rows 
pairwise orthogonal. These matrices appear in several contexts, see [H], [T7], [IB], [21], 
|28j . [30] . |31j . The main example is the rescaled Fourier matrix (w = e 2m / N ): 
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This example prevents the existence of a complex analogue of the HC. However, when 
trying to build complex Hadamard matrices with roots of unity of a given order, a subtle 
generalization of the HC problematics appears [10], [21], [2Z, 
CHC, there has been some interesting work here on the circulant case [B], [S] 
much work has gone into various geometric aspects, see [2], [0], [IS], [2Z], [2S]- 

Yet another generalization comes from [3J, [3]. The original observation from [3] is that 



In relation now with the 

m. Also, 



for an orthogonal matrix U G O(N) we have ||Z7||i < WiV, with equality if and only if 



H — v iVZ7 is Hadamard. This follows indeed from the Cauchy-Schwarz inequality: 
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This simple fact suggests that a natural and useful generalization of the Hadamard 
matrices are the matrices of type H = y/NU, with U G O(N) being a maximizer of the 
1-norm. However, since such matrices are quite difficult to approach, most efficient is to 
study first the matrices of type H = yNU, with U G O(N) being just a local maximizer 
of the 1-norm. Such matrices are called "almost Hadamard" . See [1] . 

One key feature of the almost Hadamard matrices is that at the level of examples we 
have a number of infinite series, uniformly depending on N G N. The basic example is: 
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\ 2 2 ... 2-NJ 

Observe that is circulant, and that K 4 is Hadamard. Thus we are quickly led into 
the circulant Hadamard matrix problematics, and we have the following questions: 

Problem. What are the circulant Hadamard matrices? The circulant complex Hadamard 
matrices? The circulant almost Hadamard matrices? 

More precisely, the CHC states that there are exactly 8 circulant Hadamard matrices, 
namely and its conjugates. Regarding the second question, Haagerup has shown in 
[T5] that for N = p prime, the number of circulant complex Hadamard matrices, counted 
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with certain multiplicities, is exactly (^f), and the problem is to see what happens when 
N is not prime. As for the third question, this appears from our previous work [1]. 

o 




Figure 1. The Fano plane 



Regarding this latter question, it was shown in [I] that we have a number of interest- 
ing examples coming from block designs [11], [26]. The simplest one, coming from the 



adjacency matrix of the Fano plane 
y = 2 + 3V2: 



[see the Figure), is as follows, with 
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Now back to the above 3 questions, the point is that, from the point of view of Fourier 
analysis, these are all related. Indeed, with F = / vN, the circulant unitary matrices 
are precisely those of the form U = FQF* with Q belonging to the torus formed 
by the diagonal matrices over T. So, in view of the above-mentioned remark about the 
1-norm, all the above questions concern the understanding of the following potential: 

$ : -> [0, oo) 
Q^\\FQF*\\! 

With this approach, the first thought goes to the computation of the moments of $. 
Indeed, the global maximum, or more specialized quantities such as the exact number of 
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maxima, can be recovered via variations of the following well-known formula: 

max($) = lim ( / $ fc 

Of course, in respect to the above problems, one has to restrict sometimes attention to 
the torus T n C T N , with n = 2 J + 1, coming from the orthogonal matrices. 

The origins of this approach go back to [5], where the potential $(t/) — ll^lli was 
investigated over the group O(N), in connection with the HC. Of course, the computation 
of moments over O(N) is a quite complicated question [5], [12]. In the circulant case, 
however, the parameter space being just T , the integration problem is much simpler. 
But it still remains very complicated, and we have no concrete results here so far. 

So, instead of trying to understand the potential $ : T N — > [0, oo) directly via its 
moments and analysis, we should rather try to first develop a few geometric techniques. 
The point is that $ has a number of symmetries, and when investigating these symmetries, 
the lattice {±1}^ C coming from the self-adjoint matrices seems to play a key role. 

More precisely, we will study here the circulant and symmetric orthogonal matrices, 
which correspond via Fourier transform to the sequences a G {±1}^ satisfying on = 
Our result here, motivated by the "almost Hadamard" problematics, is as follows: 

Proposition. Any circulant and symmetric matrix U G O(N)* is a critical point of all 
p-norms on O(N). The local maximizers of the 1-norm can be counted up to N = 30. 

In this statement, O(N)* C O(N) is the set of orthogonal matrices having nonzero 
entries. For more comments on this result, we refer to the body of the paper. 

Back to the general case now, one observation from [3] is that one can replace the 
1-norm by the p-norm, for any p ^ 2. Indeed, at p < 2 the Holder inequality gives: 

\\u\\ P = /P <n 2/p -' (xx) ' =n**-w 

Thus for U G O(N) we have \ \U\\ P < N 2 ^^ 1 ^ 2 , with equality if and only if the rescaled 
matrix H = \/NU is Hadamard. In the p > 2 case a similar result holds, with the Holder 
inequality applying in the reverse sense, and giving the estimate \ \U\\ P > N 2 / p ~ 1/2 . 

So, we are led to the following notion, generalizing those in [3], jl]: 

Definition. A square matrix M G M^(R) is called u p-almost Hadamard" if the rescaled 
matrix U = H/yfN is orthogonal, and is a local extremum of the p-norm on O(N). 

Here by "local extremum" we mean a local maximum at p < 2, and a local minimum 
at p > 2. Also, we will call H "optimal" if U = H\fN is a global maximum/minimum. 
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One interest in this generalization comes from the exponent p — 4, believed to be of 
interest in relation with quantum physics questions. Indeed, for U G O(N) we have: 

dns ((OS), Jn) 2 = J2 ( U i - jf) 2 = I Ml* " 1 

ij ^ ' 

This computation shows that an orthostochastic matrix A is "almost flat", in the 
sense that it minimizes the Hilbert-Schmidt distance to the flat matrix Jjv, if and only if 
Aij = Ufj, with U being a global minimizer of the 4-norm on O(N). See [7], [13] . 

We will prove here that the p-almost Hadamard matrices can be detected by using 
linear algebra. We conjecture that the basic matrix always has this property. 

The paper is organized as follows: in 1-2 we review the material in jl], and we discuss 
some new questions in the circulant case, and in 3-4 we perform some differential geometry 
computations, and we present a list of questions, that we don't know how to solve. 

Acknowledgements. The present article is part of a series started in [3], in collaboration 
with Benoit Collins and Jean-Marc Schlenker, and recently continued in jl], in collabora- 
tion with Karol Zyczkowski. It is our pleasure to thank all three for endless discussions 
and encouragements, and particularly Karol Zyczkowski for numerous recent discussions 
on the subject. The work of I.N. was supported by the ANR grant BS01 008 01. 



1. Almost Hadamard matrices 

We consider in this paper various square matrices M G Mjy(C). The indices of our 
matrices will vary in {0, 1, . . . , N — 1}, and will be sometimes taken modulo N. 

As explained in the introduction, a direct application of the Cauchy-Schwarz inequality 
shows that for O(N) we have ||?7||i < Ny/~N, with equality if and only H = \/~NU is 
Hadamard. In jl] we have introduced the notion of almost Hadamard matrix: 

Definition 1.1. An "almost Hadamard" matrix is a square matrix H G Mn(M) such that 
U = H/yN is orthogonal, and is a local maximum of the 1-norm on O(N). Equivalently, 
Uij 7^ 0, and the matrix SU l , with = sgn(£/y) ; must be positive definite. 

In this statement the equivalence between the two conditions comes from a number of 
differential geometry computations, for which we refer to j3], or jl]. 

As a first remark, any Hadamard matrix is almost Hadamard. In particular, given a 
number iV G {2} U 4N where the Hadamard Conjecture holds, the Hadamard matrices of 
order iV are precisely the almost Hadamard matrices H G Mjv(K) which are "optimal", 
in the sense that U = H/y/N is a global maximum of the 1-norm on O(N). 

The above definition provides a useful, flexible generalization of the quite rigid class 
formed by the Hadamard matrices. For instance at any iV > 3 we have a number of 
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concrete examples, which can be used for various purposes. The basic example is: 

(2-N 2 ... 2 \ 
1 2 2-N ... 2 

Kn = 7n ... 

\ 2 2 ... 2-NJ 

This matrix has several remarkable properties, for instance it is at the same time 
circulant and symmetric. Also, it has at most 2 entries, which are both nonzero. 

So, let us look now more in detail at the matrices having similar properties. We recall 
from [1] that an (a, b, c) pattern is a matrix M G M^(x, y), with N = a + 2b + c, such 
that any two rows of our matrix look as follows, up to a permutation of the columns: 

X . . . X X ... X 

. ._x y . . .y 

b 

Observe that the above matrix comes from a (0, 1, N — 2) pattern. There are many 
other examples, the main result here being that the adjacency matrix of any (N, a + b,a) 
symmetric balanced incomplete block design is an (a, b, c) pattern. See [I]. 

The following result was proved in [3]: 

Proposition 1.2. Let U = U(x,y) be orthogonal, coming from an (a,b,c) pattern. 

(1) U is a critical point of the 1-norm on O(N). 

(2) H = y/NU is almost Hadamard iff (N(a - b) + 26)|ar| + (N(c - b) + 2b)\y\ > 0. 
Proof. Since any row of U consists of a + b copies of x and b + c copies of y, we get: 

b)\x\ + (b + c)\y\ (i = j) 




(SU 



a.) 




-b)\x\ + {c-b)\y\ (i^j) 

Thus SU 1 is symmetric, and by [3] our matrix U is a critical point of the 1-norm. 
Regarding now the second assertion, observe first that we can write SU* as follows: 

SU* = 26(|ar| + |y|)ljv + ((a-6)|a;| + (c-6)|i/|)iVJjv 

= 26(|x| + |y|)(ljv - Jn) + ((N(a - b) + 26)|z| + (N(c - b) + 2b)\y\))J N 

Now since ljv — Jn, Jn are orthogonal projections, we have SU* > if and only if the 
coefficients of these matrices are both positive, and this gives the result. □ 

Let us go back now to our observation that is at the same time circulant and 
symmetric, and look in detail at the matrices having these two properties. We fix iV G N, 
and we denote by F = F N G U(N) the Fourier matrix, F = (w^)/yN with w = e 2m ^ N . 
Also, given a vector q G C N , we associate to it the diagonal matrix Q = diag(g , • • • , Qn-i)- 

Lemma 1.3. For a matrix H G Mtv(C), the following are equivalent: 
(1) H is circulant, i.e. Hij = 7 3 -_i, for a certain vector 7 G C . 
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(2) H is Fourier- diagonal, i.e. H = FDF* , with D G M/v(C) diagonal. 
In addition, if so is the case, then with D = \/NQ we have 7 = Fq. 

Proof. This result is well-known, and the proof goes as follows: 

(1) ==>- (2) The matrix D = F*HF is indeed diagonal, given by: 

kl r 

(2) ==>■ (1) The matrix H = FDF* is indeed circulant, given by: 

k k 

Finally, the last assertion is clear from the above formula of H^. □ 

The following result is as well from jl]: 

Proposition 1.4. A circulant matrix H G M^(W), written B.^ = 'Jj-i, is almost Hada- 
mard if and only if the following conditions are satisfied: 

(1) The vector q = F*^ satisfies q G T . 

(2) With e = sgn(7), pi = J2 r e rli+r an d v = F* p, we have v > 0. 

Proof. By Lemma 1.3 the orthogonality of U is equivalent to the condition (1). Regarding 
now the condition SU t > 0, this is equivalent to S t U > 0. But: 

{S t H) i j = SkjH k j = '^^Ei-klj-k = £r7j-i+r = Pj-i 
k k r 

Thus S l U is circulant, with p/y/N as first row. From Lemma 1.3 again we get S l U = 
FLF* with L = diag(u) and v = F*p, so >S*£/ > iff v > 0, and we are done. See [I]. □ 

Now, let us investigate the circulant and symmetric orthogonal matrices: 

Lemma 1.5. For a matrix U G Mjv(C), the following are equivalent: 

(1) U is orthogonal, circulant and symmetric. 

(2) U = FQF* with q G {±1}^ satisfying q t = q_ { . 

Proof. It follows from Lemma 1.3 that U is orthogonal and symmetric iff U — FQF*, 
with q G T N satisfying qi = The symmetry condition reads (Fq)i = (Fq)-i which 
translates into the following system of equations, with i — 0, . . . , N — 1: 

J2w ik (q k -q_ k ) = 

k 

This system admits the unique solution q k — q~ k = 0, and the result follows. □ 
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As an example, the vector q = (— l) n (l, —1, 1, . . . , —1, 1, 1, —1, . . . , 1, —1), having length 
N — 2n + 1, produces the following N x N matrix, from [I]: 

/ 1 -cos" 1 ^ cos- 1 ^ ... cos- 1 ^^\ 



ln ~n~ 



N N • • • , N 

_1 (JV-lW -1 7T -1 (N-2)ir 

COS v N ' 1 — COS ... — COS v ; 

\ - cos" 1 § cos" 1 ^ - COS" 1 jf- ... 1 / 



Let us reformulate now the above result in a more convenient form, and gather as well 
some examples. Recall that the integer part of a real number r is denoted \r\ ■ 

Proposition 1.6. There are 2 n circulant symmetric orthogonal N x N matrices, indexed 
via U = FQF* by sign vectors q G {±l} n ; where n = 2 J + 1. The examples include: 

(1) The identity matrix In, coming from q = (1, 1, . . . , 1). 

(2) The matrix Un — 2J N — In, coming from q = (1, —1, — 1, . . . , —1). 

(3) For N even, the matrix Sn = (? J), coming from q = (1, —1, 1, . . . , —1, 1, —1). 

(4) For N odd, the above matrix Ln, coming from q = (— l)^ J (1,-1,1, ... ,—1,1). 

Proof. The first assertion follows from Lemma 1.5, and from the fact that the condition 
qi = q-i is redundant for i = for all N, and for % = N/2 when iV is even. The vector q 
generating the orthogonal matrix is then given by q = (q , qi, q 2 , ■ ■ ■ , (?2> Qi)- 

Regarding now the assertions (1-4), we just have to prove here that the q- vectors in the 
statement produce indeed the matrices in the statement. But this is clear for (1-3), and 
(4) follows as well, via an elementary computation performed in |3] . □ 

Theorem 1.7. The number of orthogonal circulant symmetric matrices (OCS), orthogo- 
nal circulant symmetric matrices with nonzero entries (OCSN) and circulant symmetric 
almost Hadamard matrices (AHM) is as follows: 
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Proof. This follows from a computer implementation^ of the algorithm in Proposition 1.4, 
by using as input the vector q G {±l} n , with n = \_N/2\ + 1, from Proposition 1.6. □ 

Observe the arithmetic dependence of the above numbers with N. However, this depen- 
dence is not exact, so in order to formulate an exact conjecture here, we would probably 
have to take into account certain algebraic geometric multiplicities, as in Haagerup's pa- 
per [T5] . The only observation we can make at this point is that, for prime N, there are 
only two OCS matrices with zero entries, ±ljv- We intend to come back to these questions 
in some future work. 

2. Critical points, color decomposition 

In this section we characterize the critical points of the p-norm on O(N). Our starting 
point, which motivates our study, is the following simple observation from [3]: 

Proposition 2.1. Let U G 0(N), and let p G [l,oo] - {2}. 

(1) Ifp < 2 then \ \U\\ P < N 2 l p ~ 1 / 2 , with equality iff H = y/NU is Hadamard. 

(2) Ifp > 2 then \\U\\ P > N 2 l p ~ l l 2 , with equality iff H = ^/NU is Hadamard. 

Proof. In the case p < 2, the Holder inequality gives: 

\\U\\ P < N 2/p - l \\U\\ 2 = N 2/p - l/2 

Also, in the case p > 2, the Holder inequality gives: 

||t/|| P >iV 2 / p - 1 ||£/|| 2 = iV 2/p - 1/2 

In both cases the equality holds when all the numbers \Uij\ are proportional, and we 
conclude that we have equality if and only if = 1/y/N, as stated. □ 

Observe that at p = l,4,oo we obtain < N^N, \\U\\ 4 > 1, ||f/||oo > l/VN, in 

all cases with equality if and only if the rescaled matrix H = yNU is Hadamard. 

Definition 2.2. A matrix H G Mjv(M) such that U = H/yN is orthogonal is called: 

(1) p-almost Hadamard (p < 2), if U locally maximizes the p-norm on O(N). 

(2) p-almost Hadamard (p > 2), if U locally minimizes the p-norm on O(N). 

As a first remark, given an exponent p ^ 2 and a number iV G {2} U 4N where 
the Hadamard Conjecture holds, the Hadamard matrices of order N are precisely the 
p-almost Hadamard matrices H G Mjv(M) which are "optimal", in the sense that the 
rescaled matrix U = H/yN is a global maximum/minimum of the p-norm on O(N). 

Let us try now to understand the critical points of the various p- norms on O(N). 
Consider the set O(N)* C O(N) of orthogonal matrices having nonzero entries. Given 
a function <p G C 1 (0,oo), the associated function F(U) = ^(Wijl) is differentiable 
around each U G 0(N)*, and the critical points of F can be found as follows: 



Source code available at |http://www. irsamc.ups-tlse.fr/inechita/code/ocsn-ahiri.zip 
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Lemma 2.3. For U G O(N)* and tp G C 1 (0, oo), the following are equivalent: 

(1) U is a critical point of F(U) = v(l^yl)- 

(2) WU t is symmetric, where Wij = sga(Uij)(p' (\Uij\) . 

Proof. We follow the proof in [3], where this result was established for <£>(x) = x. We 
know that the group O(N) consists of the zeroes of the following polynomials: 

Aij = ^ UikUjk — o~ij 

k 

Also, U is a critical point of F iff dF G span(dAij). Now since A^ = A^, this is the 
same as asking for a symmetric matrix M such that dF = ^ . MijdAij. But: 

^ MydA^ = M i3 (U ik dU jk + Z7 ife rfZ7 ife ) = 2 ^{MU) lk dU lk 

ij ijk Ik 

On the other hand, with SV,- = sgn(C/y), we get: 

dF = Y,d{ V {S l3 U i3 )) = Y,Si 3 v'{S l3 U l3 )dU i3 •,///;, 

ij ifc ij 

We conclude that U is a critical point of F iff there exists a symmetric matrix M such 
that W = 2MU. Now by using the assumption U G O(N), this condition simply tells us 
that the matrix M = WU t /2 must be symmetric, so we are done. □ 

In order now to investigate the symmetry property of the matrix WU l appearing in the 
above statement, we use the following notion: 

Definition 2.4. The color decomposition of U G O(N)* is U = Y^ r >o r ^^' where: 

Mr) = Ugn(Uij) ii\Uij\=r 

The matrices G Mjv(— 1,0, 1) will be called "color components" ofU. 

If we let Sij = sgn([/jj), then for any if) : (0, oo) — > M we have the following useful 
formula, that we will use many times in what follows: 

r>0 

Let us investigate now the critical points of all p- norms on O(N): 

Theorem 2.5. For U G O(N)* , the following are equivalent: 

(1) U is a critical point of the p-norm on O(N), for any p G [1, oo]. 

(2) U is a critical point of F{U) = <p(\Uij\), for any <p G C 1 (0, oo). 

(3) WU l is symmetric for any if) : (0, oo) — >■ K ; where = sga(Uij)if)(\Uij\) . 

(4) U^U 1 is symmetric for any r > 0, where U^) are the color components ofU. 



ALMOST HADAMARD MATRICES: THE CASE OF ARBITRARY EXPONENTS 11 

Proof. The result basically follows from Lemma 2.3: 

(1) -<=>- (2) In one sense this is trivial, because it suffices to choose the continuously 
differentiable functions <p(x) = x p . In the other sense, this follows from the fact that the 
functions ip(x) = x v span a dense subalgebra of C 1 (0, oo). 

(2) -<=>- (3) This follows from Lemma 2.3, because the condition found there is purely 
algebraic, and hence doesn't depend on the fact that if) = tp' is continuous. 

(3) ^=r- (4) We have the following formula: 

(WU t ) ij = Y,*&(U ik )i/>(\U ik \)U jk = ^il>(r) ^(U lk )U 3k 

k r>0 k,\U ik \=r 

In terms of the color components of U, this formula becomes: 

(W% = J>(r) "£u^U jk = ^(r)(U^U% 

r>0 k r>0 

Thus the matrix appearing in (2) is simply given by: 

WU* = J^ip^U^U* 

r>0 

Now since if) : (0, oo) — > M can be here any function, the result follows. □ 

As a first consequence, we have: 

Corollary 2.6. Let U = U(x,y) be orthogonal, coming from an (a,b,c) pattern. Then U 
is a critical point of all the p-norms on O(N). 

Proof. As explained in jl] the fact that U is orthogonal shows that x, y have opposite 
signs, and we will make the same normalization as there, namely x < 0, y > 0. 

Consider now the matrices U x , U y G M^(0, 1) describing the positions where our vari- 
ables x, y sit inside U. Then we have the following formulae: 

UW = -U X , U {y) =U y , U = xU x + yU y , U x + U y = NJ N 
By using these formulae we obtain that U^U 1 is indeed self-adjoint: 

UMU* = -U^xUi + yUl) 

= -U^xUi + yiNJx-Ui)) 
= {y-x)U x Ul-yNJ N U x 
= {y-x)U x Ul-y{a + b)NJ N 
A similar computation shows that U^U 1 is self-adjoint as well, and we are done. □ 
We have as well the following consequence: 

Corollary 2.7. Any circulant and symmetric matrix U G O(N) having nonzero entries 
is a critical point of all p-norms on O(N). 
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Proof. For a color r > 0, consider the set of indices where this color appears on the first 
row, D r = {k\\^ k \ = r}. From ji = 7_j we get D r = —D r , and so: 

k seD r teD r 

This shows that U^U 1 is symmetric, and we are done. □ 

The above corollary can be regarded as a slight advance on a key problem raised in [I], 
namely that of characterizing the circulant almost Hadamard matrices. 
We have as well the following question, that we believe of interest: 

Problem 2.8. What are the matrices U G O(N)* having the property that U^ r \U^Y is 
symmetric for any r,s, where U = X] r >o r ^^' ) ^ s c °l° r decomposition? 

The point is that all the examples of joint critical points of all p- norms on O(N) that we 
have, namely the rescaled Hadamard matrices, the matrices coming from (a, b, c) patterns, 
and the circulant and symmetric matrices, satisfy in fact this stronger condition. 

Observe also that the condition in Problem 2.8, involving just —1,0,1 matrices, is 
purely combinatorial. In fact, what we have there is an axiomatization of some new 
"design-type" combinatorial structure, generalizing the Hadamard matrices. 

3. Local extrema, the rotation trick 

In this section and in the next one we find an algebraic criterion for detecting the 
p-almost Hadamard matrices, by building on the previous work in [3J at p = 1. 

The result will basically come from the computation of the Hessian of the p-norm on 
O(N). However, since this p-norm is in general not differentiable at points U G O(N) 
having zero entries, we first must prove that the local extrema belong to O(N)*. 

At p = 1 this was done in [3], by using a "rotation trick". The same trick works in fact 
at any p < 2, but with some more calculus needed afterwards, and we have: 

Theorem 3.1. If U G O(N) is a local maximum of the p-norm on O(N), for some 
exponent p G [1,2), then U G O(N)*. 

Proof. Let Ui, . . . , Un be the columns of U, and let us perform a rotation of Ui, U2: 

\Ul) ~ ^sint • Ui + cost • U 2 ) 

In order to compute the p-norm, let us permute the columns of U, in such a way that 
the first two rows look as follows, with ^ 0, ^ 0, A^Ck > 0, BkD k < 0: 

fU x \ _ (0 Y A B\ 
\U 2 ) ~ [0 X CD) 
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Let us compute now the following quantity: 

<p(t) = n-\\u\\ p 



p ll~ Up 

p 



= || cos* • Z7i — sin* - 1/ 2 ||^+ || sint - C/x + cost • C/ 2 ||^ - ||£/i||£- \\U 2 \ ip 
We have the folowing formula: 

ip(t) = \\smt-X\\ p p + || cost -Y\\ p p + || cost • A -sint -C\\ p p + || cost sint • 

+ ||cost-X||£ + ||sint-y||£+ || sint- A + cost ■ C\\ p p + || sint • B + cost • D\\ p p 



- \\X\\l-\\Y\\ p p -\\A\\ p p -\\B\\ p p -\\C\\ p p -\\D\\ p p 
Thus for t > small we have: 

<p(t) = (sin p t + cos p t-l)(\\X\\ p p +\\Y\\ p p ) 

+ \\ cost -A-sint-Clll+ll sint- A + cost- C\\ p p -\\A\\ p p -\\C\\ p p 
+ || cost B- sint- D\\* + || sint B + cost • D\\ p p - \\B\\ p p - \\D\\ p p 
Now by remembering our conventions A k C k > 0, B k D k < 0, we obtain: 
<p(t) = (sinn + cos^-l)(||X||^+||F||p 

+ ^(cost|A fe | - sint|C fc |) p + (cost|L7 fc | + sint|A fe |) p - \A k \ p - \C k \ p 
k 

+ ^(cost|5 fc | + sint\D k \) p + (cost\D k \ - sint|5 fe |) p - \B k \ p - \D k \ p 

k 

Consider now the matrix V obtained by interchanging U\,Ui- If we perform to it a 
rotation as above, then the quantity ip(t) = ||V*||£ — ||V||£ is given by: 

^(t) = (sin p t + cos p t-l)(||X||^+||F|p 

+ ^(cost|C7 fc | - smt\A k \) p + (cost|A fe | + sint|C fc |) p - |A fc |" - |C fc |* 
k 

+ ^{cost\D k \ + sintl^lf + (costal - s\ut\D k \) p - \B k \ p - \D k \ p 

k 

Let us introduce now the following function depending on a, c > 0: 

7t(a, c) = (cost ■ a + sint • c) p + (cost • c + sint • a) p 
+ (cos t ■ a — sin t ■ c) p + (cos t • c — sin t • a) p 
- 2a p -2c p 

With this notation, if we sum the above two formulae of tp, ip, we obtain: 
<p{t)+1>{t) = 2(sin p t + cos p t-l)(||X|| p +||F|p 

+ ^2'Yt(\A k \,\C k \) + Y / 'Yk(\B k \,\D k \) 
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Now observe that the derivative of this quantity is given by: 

if'(t)+ijj'(t) = 2p(sin p - 1 t cost- cos?- 1 1 sin t)(\\X\\ p p + \\Y\\ p p ) 

+ Y,rt\MACk\) + Y,i , k(\Bk\,\Dk\) 

k k 
So, let us compute now the derivative of j t ' 

7 t '(a, c) = p(cost • a + sint ■ c) p_1 (— sint • a + cost • c) 

+ (cos t • c + sin t ■ a) p_1 (— sin t ■ c + cos t ■ a) 

+ (cos t ■ a — sin t ■ c) p (— sin t ■ a — cos t ■ c) 

+ (cos t ■ c — sin t ■ a) p (— sin t ■ c — cos t ■ a) 

By using sint = t + 0(t 2 ) and cost = 1 + 0(t 2 ) we obtain: 

%{a,c) ~ p(a + tc) p_1 (c-ta) +p(c + ta) p_1 (a - tc) 
- p(a - tc) p_1 (c + ta) - p(c - ta) p_1 (a + tc) 

By using the power series expansion for the exponentials, this gives: 

it^A. ~ ( a P-i + ( p _i) a P-2 tc )( c _ ta ) + ( c P-i + ( p _ 1 ) c P-2 ta)(a _ te) 

- (a p - x - (p - l)a p - 2 tc)(c + ta) - - (p - l)c p - 2 ta)(a + tc) 
The order terms cancel, and by neglecting the order 2 terms we obtain: 

P 

- (a p - (p - l)a p ~ 2 c 2 )t - (c" - (p - iy- 2 a 2 )t 
Now since the upper and lower terms are the same, we obtain: 

= (p - l)(a p - 2 c 2 + a 2 c p - 2 ) - (a p + c p ) 

With these formulae in hand, we claim that X, Y both follow to be null vectors. Indeed, 
since we are in the case p G [1, 2), the matrices U, V are local maximizers of the p-norm. 
Thus if, ip < for t > small, so we must have if' + < for t > small. But: 

f'{t) + i/(t) = 2pt^ (| \X\ | p + | \Y\ | p ) + 0(t) 

Thus we have ||X|| P + ||F|| P < 0, and so X,Y are both null vectors, as claimed. 

Summarizing, we have proved that the entries of U\,U2 must appear at the same 
positions. By permuting the rows of U the same must hold for any two rows Ui, Uj. Now 
since U G O(N) cannot have zero columns, all its entries must be nonzero, as claimed. □ 
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It is not clear whether the same holds at p G (2, oo). Here U, V are local minimizers of 
the p-norm, so ip, ip > for t > small, so (p' + ip' > for t > small. But: 

ip\t)+tl>\t) = -2pt{\\X\\ x + \\Y\\ 1 ) + 2ptS ABC D + 0(t 1+e ) 

Here Sabcd is a sum of quantities of the following type, one for each pair of adjacent 
entries of A, C, and one for each pair of adjacent entries of B, D: 

K(a, c) = (p - l)(a p - 2 c 2 + a 2 c p - 2 ) - (a p + c p ) 

The problem comes from the fact that these quantities, and hence their sum Sabcd as 
well, can be positive, so that we cannot conclude that we have \\X\\ P + \ \Y\\ P < 0. 

The case p = oo is also very problematic, because when the maximum M = max \ Uij\ 
appears at many places in our matrix, the rotation trick obviously cannot work. In fact, 
there are many problems here, and the rotation trick at p = oo seems to require precise 
information about the positions of the M and entries in our matrix. 

Of course, the fact that the rotation trick might fail at p G (2, oo] is not an indication 
that the conclusion U G O(N)* should fail itself, but just of the fact that the good rotation 
U* = Ue tA might come from more complicated antisymmetric matrices A G Mjv(R). 

Here is an example of such a result, excluding a few matrices having zero entries: 

Proposition 3.2. An antisymmetric matrix A G O(N) cannot be a local extremum of the 
p-norm on O(N), for any p > 1. 

Proof. Since A is orthogonal and antisymmetric, we have A 2 = —A A 1 — — 1, and so: 

e tA = cos t - A — sin t ■ 1 n 

We analyze, to the first order in t — > 0, the following function: 

\\Ae tA \\ p p - \\A\\ p p = (\cost\ p - l)||,4||£ + iV|sint| p 

At p < 2 this function behaves like N\t\ p , so A cannot be a local maximum for the 
p-norm, since it is a local minimum in the direction A. Similarly, at p > 2 the norm 
difference behaves like — p\t\ 2 /2, so A cannot be a local minimum for the p-norm. □ 

4. The Hessian formula, open problems 

In this section we find an algebraic criterion for detecting the p-almost Hadamard 
matrices. For this purpose, let us first go back to Theorem 2.5 above, and introduce: 

Definition 4.1. To any U G O(N)* we associate the matrices L r = U^U 1 and R r = 
U^^, where U = Xlr>o r ^ <r ' ) ^ s ^ e c °l° r decomposition of U . 

According to Theorem 2.5 above, in the case where U is a critical point of all the p- 
norms on O(N), the matrices L r are all symmetric, and the matrices R r = U l L r U follow 
to be symmetric too. Observe also that we have the following formula: 

r>0 r>0 
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We now study the local extrema of the p-norm on O(N)*. We use: 
Lemma 4.2. Let U G O(N)*, let p G [l,oo), and for A G M N (M) antisymmetric, set: 
<p(A) = J2 WijrMp - l)(UA)l + £^(£M%) 

TTien U is a local maximizer / minimizer of the p-norm iff J2 r>0 r p ~ 1 Tr{R r A t ) = 0, and 
the quantity (p(A) is positive/negative, for any A G Mjv(R) antisymmetric. 

Proof. Since the Lie algebra of SO(N) consists of the antisymmetric matrices A G Mjv(R), 
in the neighborhood of U G O(N) we have matrices of type Ue tA , with A antisymmetric, 
and with t G R close to 0. So, let us fix A G Mjy(M) antisymmetric, and set: 

f(t) = \\Ue tA \\l 

With S'jj = sgn(C/jj), for t G R close enough to we have: 

/(o = e \( UetA h\ p = E^^^r 

Now the derivative of this function with respect to t is given by: 

fit) = j2p\( UetA hr ls ^ Uet % 

ij 
ijfc 

= E^^i(^^rv% 

ijk 

In particular at t = we obtain the following quantity, whose vanishing corresponds to 
the first condition in the statement: 

= pE^i^r 1 ^)^ 
= pE cr_lTr (^ A< ) 

r>0 

Also by using the above formula, let us compute now the second derivative: 

ijfc 
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At t — now, by using (e tB )j t=0 = B for any B G M N (R), we get: 

i j k 

= Y.P U ^\Uij\ p - 2 {{p ~ l){UA) l3 A k] + Uij(A 2 ) k j) 

ijk 
ij 

Thus we have /"(0) = pip(A), and this gives the result. □ 

Theorem 4.3. A matrix U G O(N)* is a local maximizer /minimizer of the p-norm on 
O(N), with p G [1, oo), if and only if the matrix J2 r >o rP ~ 1 Lr is symmetric, and with 



Yab,cd = $bd^2((p- 
i 


-i) 


\u ib r 2 - 


- \u tc \ p 


) Ui a U%c 


i 


-i) 


\u ib r 2 - 


- \u id \ p 


2 )Ui a Uid 


i 


-i) 


\u ia r 2 - 


- \u lc \ l 


2 )UibU ic 




-i) 


\u ia r 2 - 


- \u ld \ l 


2 )u ib u id 



i 



the quadratic form ip = J2 abcd y a b,cdBabB cd is positive/negative. 

Proof. Let us look at the two conditions found in Lemma 4.2. The first condition, namely 
that we have ^ r>0 r p ~ 1 Tr(R r A t ) = for any A G M N (R) antisymmetric, is equivalent to 
the first condition in the statement, namely that the matrix 'Yl ir> Q^ p l L r is symmetric. 
The quantity found in Lemma 4.2 can be written as: 

v = Y.\ u ^r 2 (ip- l )( UA %+ u iAuA 2 ) ij ) 

= Yl \ U »\ P ~ 2 ((P - ^UikUaAkjAij + UijUikAkiAj) 

ijkl 

= E \ U ^\ P '\P - l)U lk U a A k3 A l3 - \U u \ p - 2 U u U lk A kl A l:j 

ijkl ijkl 

= ]T((p - l)|^r 2 - \U u \ p - 2 )U ik U u ■ A k3 A l3 

ijkl 
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Now recall that A e M^(M.) is an arbitrary antisymmetric matrix. So, let us write 
A = B — B f . In terms of the matrix B G Mjy(M), which can be arbitrary, we have: 

V = - ^l^r 2 - \Uii\ p ~ 2 )U ik Uu{B kj - B jk )(B l:j - B jt ) 

ijkl 

The expression on the right in the above formula is: 
X = (B kj — B jk )(Bij — Bji) 

= ^^i$kj,ab — 8jk,ab)(8lj,cd ~ Sjl,cd)B ab B cd 
abed 

= ^~](8bd5jkl,bac — 8bcSjkl,bad ~ ^ad^jkl,abc + &ac&jkl,abd)B ab B cd 
abed 

It follows that our map ip is given by: 

up = j]s a6J B C(i j]((p-i)|c/yr 2 -i^r 2 )c/ ife ^ 

abed ijkl 

($bd$jkl,bac ~ $bc$jkl,bad ~ Kd^jkl,abc + &ac&jkl,abd) 

But this gives the formula in the statement, and we are done. □ 

Observe that at p = 2 we have p = 0. Also, in the case where the rescaled matrix 
H = \/NU is Hadamard, the rescaled matrix Y = (\/N) p ~ 2 Y is given by: 

Yab,cd = (P ~ l)$bd ^ UjgUic ~ (P ~ l)$bc ^ UjgUid ~ &ad ^ UjbUjc + $ac ^ UjbUjd 

i i i i 

= ip - l)S bd S ac - ip - l)S bc S ad - 5 ad 5 bc - S ac S bd 
= (p-2) i5 ac 5 hd - 5 ad 5 bc ) 

Thus the rescaled quadratic form p> = i\fN) p ~ 2 p is given by: 

Pab,cd = ip — 2) ^^i^achd — S ad 5 bc )B ab B cd 

abed 

= ip-2)J2iB 2 ab -B ab B ba ) 

ab 

= iP-2)- l -Y,^- B ^) 2 

ab 

These computations agree of course with the fact that the 2- norm is constant on Oat, 
and that the multiples of Hadamard matrices are p-almost Hadamard, for any p. 

Observe that some simplifications appear as well at p — 1. Here we obtain of course 
the fact that the matrix SU l must be positive, as stated in Definition 1.1 above. 

In general, the formula in Theorem 4.3 is quite a theoretical one, but can be used on a 
computer. As an example of potential application, our computer simulations suggest: 
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Conjecture 4.4. K N is p-almost Hadamard, for any N and p. 

Regarding a possible direct proof, let Un = K N /\/N, and observe first that for U G 
O(N) we have NJ^U = (Sj)ij, where Si are the sums on the columns of U, so: 

2 

(U N U)ij = —Sj - Uij 
Thus, we have the following formula for the p-norm of a perturbation of Un' 

\\UnU\\1 = J^luu-^s^ 

ij 

The problem is to prove that this quantity is locally minimized/maximized at U = ljv- 
This looks like a quite tricky problem, and we don't have results. 

We have as well a series of questions concerning some possible extensions of this con- 
jecture. We know from Corollary 2.6 and from Corollary 2.7 that both classes of matrices 
"coming from designs" and "circulant and symmetric" are critical points of all p-norms. 
We believe that the good framework is the "circulant design" one, and we have: 

Problem 4.5. Consider the matrices in O(N) coming from circulant designs. 

(1) What are these matrices, combinatorially speaking? 

(2) Which of these matrices have nonzero entries ? 

(3) When are these matrices p-almost Hadamard? 

In relation with question (1), one remark is that the Fano plane matrix is indeed 
circulant, so the answer to the problem is certainly not trivial. Question (2) looks easy 
but is probably not entirely trivial, because we have to exclude here for instance the 
identity matrix 1^. As for (3), this is definitely not trivial, among others because an 
answer here would probably require a serious combinatorial input, coming from (1). 
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